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Whereas in standard quantum state tomography one estimates an unknown state by performing 
various measurements with known devices; and whereas in detector tomography one estimates the 
POVM elements of a measurement device by subjecting to it various known states, we consider here 
the case of SPAM (state preparation and measurement) tomography where neither the states nor 
the measurement device are assumed known. For d-dimensional systems measured by d-outcome 
detectors, we find there are at most d 2 (d 2 — 1) “gauge” parameters that can never be determined by 
any such experiment, irrespective of the number of unknown states and unknown devices. For the 
case d = 2 we find new gauge-invariant quantities that can be accessed directly experimentally and 
that can be used to detect and describe SPAM errors. In particular, we identify conditions whose 
violations detect the presence of correlations between SPAM errors. From the perspective of SPAM 
tomography, standard quantum state tomography and detector tomography are protocols that fix 
the gauge parameters through the assumption that some set of fiducial measurements is known or 
that some set of fiducial states is known, respectively. 


I. INTRODUCTION 

Quantum tomography has become an important tool 
for characterizing quantum devices [ 10 . For example, 
in order to estimate the state p of quantum systems pro¬ 
duced by a given quantum source, we let the source cre¬ 
ate many copies of p , and subject those copies to different 
measurements. If we describe the whole set of measure¬ 
ments by POVM elements Ip (such that H = 1), the 
experiment will produce estimates of probabilities 

Pi = Tr(plh), (1) 


measurement device, whose observed outcomes now tell 
us about the POVM elements that describe the detector. 
Detector tomography makes sense: A photodetector, for 
example, is not such a simple device theoretically. There 
are infinitely many modes of the electromagnetic field, 
and each mode is described by an infinite-dimensional 
Hilbert space. A simple candidate model for a photode¬ 
tector can be formulated in terms of a quantum efficiency 
rj and a dark count rate r, but these two parameters could 
drift over time and be field-mode dependent. In the case 
of detector tomography, we estimate probabilities of the 
form 


with YliPi = 1- If we have at least as many observed 
independent relative frequencies /,; (the number of times 
outcome i was observed divided by the total number of 
measurements) as there are parameters in p, then we may 
estimate p via a variety of different methods (from linear 
inversion to hedged maximum likelihood estimation and 
Bayesian methods, see, e.g., 0 - 0 ). Legitimately ending 
up with an estimate of a single density matrix p does re¬ 
quire some assumptions about the experimental setup 0- 
[8j], namely, invariance under permutations of the copies. 
In particular, even if there is drift (for example, due to a 
slowly changing magnetic field or a slowly changing phase 
of the laser field used to prepare our quantum systems), 
such that copy p n differs slightly from the previous copy 
p n ~ i, we still obtain an estimate of a single density ma¬ 
trix, namely, the average: 

1 Nt 

P = ~N t H Pn ' ( 2 ) 

n= 1 

in an experiment with N t copies of states in total, pro¬ 
vided the different measurements are performed in ran¬ 
dom order. 

In more recent times several experiments on detector 
tomography have been performed [M|. Here the as¬ 
sumption is that known states are fed into an unknown 


Pn = Tr(p M n), (3) 

for different known states (where p runs from 1 to M), 
in order to estimate the POVM element n, where the 
latter is really an average (under the proviso that we test 
the measurement device with different states created in 
random order, to ensure symmetry under permutations 
of the measurements) over the course of the experiment of 
the different POVM elements H n describing the drifting 
detector. In complete analogy to 0 we may write then 


N t 


H - N t ^ Un ' 

n= 1 


( 4 ) 


if we performed this particular measurement N t times in 
total. 

The present paper focuses on two questions: First, 
what exactly can we find out about measurements and 
states when we do not assume we know either one or the 
other? That is, what can we infer about the POVM ele¬ 
ments H; describing different measurement settings and 
about different states from estimates of probabilities 

Pn,i = Tr(p M nQ? (5) 

Second, what changes if the fluctuations (or drift) in 
state preparation and measurement are correlated? That 


is, what if the observed average relative frequencies are 
not determined by averaged states and averaged mea¬ 
surements? How can we infer there are correlations such 
that 


TrO^Hi) 7^ T^IIO? (6) 

Here the left-hand side is a quantity for which we ob¬ 
tain direct estimates from observed relative frequencies, 
but the two quantities p M and on the right-hand side 
are not directly accessible individually and have to be 
inferred. 

Such questions have become of interest in the con¬ 
text of debugging quantum devices that are meant to 
serve as fault tolerant quantum computers. The require¬ 
ments on fault tolerance are quite stringent and more and 
more precise tools for analyzing tomography experiments 
have been developed very recently [13Hl5l |. Correlations 
between errors are particularly bad for fault tolerance, 
hence our focus on detecting correlated SPAM errors. 

We are going to analyze SPAM tomography for the 
case of a single qubit and two-outcome POVMs. Even 
for this simple case our results suggest novel experiments. 
Most results generalize easily to multiple qubits and/or 
higher-dimensional systems and measurements, and in 
particular to the combination of d dimensional systems 
and d-outcome POVMs. 


II. SPAM TOMOGRAPHY 
A. Notation 

To set the notation, let us first consider the simplest 
version of our problem: a single qubit state, p, and a sin¬ 
gle two-outcome qubit detector, with the two outcomes 
described by POVM elements { E , ~<E} (read: E and 
NOT -E). From the conditions p > 0, E > 0, —■ E > 0, 
Trp = 1, and E + ->E = U, a general parameteri¬ 
zation can be written in terms of the Pauli matrices 
a = ( a x ,a y ,a z ) T as 

P = ^(1 +P-&), 

E = -^(1 + u)l + w ■ a'j, 

- 1E = ^((l-w)l - w-ct), (7) 

where positivity is ensured by the inequalities 

IpI < i, 

\w\ + |u| < 1. (8) 

Altogether there are seven parameters. Three of these 
correspond to a choice of coordinate system (when we 
think of the qubit as a spin-1/2 system, we have to specify 
what we mean by the components of spin in the x, y, z 
directions), or, equivalently, to a choice of basis for the 


2D Hilbert space describing our qubit. The other four 
parameters can be thought of as state purity (|p|), the 
detector’s discrimination power (|ic|), detector bias (u), 
and the alignment (or fidelity) between detector and state 
(p ■ w). We may conveniently represent the two-outcome 
POVM by a single observable: 

E = E — -i E = ut + w ■ a. (9) 

Knowledge of its expectation value S in the state p gives 
us a single constraint on these seven parameters: 

S = TrpE = p ■ w + u. (10) 

B. Counting parameters 

Now consider the more interesting case where we per¬ 
form two-outcome measurements with a variety of states 
and observables. We press the p-th button of a quan¬ 
tum source, where p, runs from 1 to M, representing the 
preparation of state p^, while the dial of the measure¬ 
ment device is turned to the i -th setting (where i runs 
from 1 to TV), symbolizing the observable E,;. The out¬ 
come of measurement i is either that an indicator light 
blinks (Ei) or does not blink (-ids*). 

Considering the number of undeterminable parame¬ 
ters, 7 , a first guess might be that there are 

7 = (# unknowns) — (# constraints) = 3 M + 47V — NM 

( 11 ) 

of them, where we assume we always subject all M states 
to all TV measurements. This kind of counting is correct 
so long as either TV < 3 or M < 4 but otherwise it fails 
to take the following important observation into account: 
For M = 4, there are sufficiently many states to perform 
complete detector tomography on any number of detec¬ 
tors. Similarly, N = 3 would be a sufficient number of 
2 -outcome detectors to perform complete state tomogra¬ 
phy. Hence for any N > 3 and M > 4, there are just 
as many undeterminable parameters as when N = 3 and 
M = 4, because the addition of any states (or detectors) 
could reveal no more information about the detectors (or 
states) than the “original” 4 states (or 3 detectors) al¬ 
ready yield. (This argument generalizes to d-outcome 
measurements performed on d-dimensional systems: each 
d-outcome measurement yields d— 1 independent frequen¬ 
cies. d + 1 known d-outcome POVMs are, therefore, suffi¬ 
cient to determine the d 2 — 1 parameters of any unknown 
state. Conversely, d 2 known states (each needing d 2 — 1 
parameters for its specification) are needed to determine 
the d 2 parameter of any unknown POVM element. Thus, 
the maximum number of gauge parameters is d 2 (d 2 — 1 ).) 
Therefore the number of undeterminable parameters is 

3M + m-NM : TV < 3 or M < 4 , . 

12 : TV > 3 and M > 4 

(From now on we assume we do have at least 4 states 
and at least 3 detectors, so that the number of undeter¬ 
minable parameters is 12.) In order to find what these 



undeterminable parameters are we first introduce some 
convenient notation. First of all, the set of all constraints 
can be written succinctly as a matrix equation, 


S = PW 


where 


and 




(13) 


(14) 


(15) 


are the N x 4 and 4 x M matrices of state and detector 
parameters, respectively, where we included a constant 1 
(which stands for Tr(p^)) in the set of state parameters 
just to make the treatment of states and detectors more 
symmetric. The form of equation m makes obvious 
that the system of constraints has the symmetry: 

P— >PG~\ W — >GW, (16) 


where G is of the general form 


H 0 
a T 1 


(17) 


where H is any real 3x3 invertible matrix, H e GL(3), 
and a is any real 3-vector, Sel 3 . The collection of all 
matrices G £ R 3 x GL( 3) forms the affine group. 


C. Blame Gauges: A New Perspective on 
State/Detector Tomography 


Some of the 12 parameters determining G should be fa¬ 
miliar. Most important are perhaps the three parameters 
needed to describe matrices of the form 


G = 


R 0 
0 1 


(18) 


where R is in the adjoint representation of SU(2) (which 
here is just SO(3).) This space of symmetries is al¬ 
ways present (also for higher-dimensional Hilbert spaces 
of size d where we would have the adjoint representation 
of SU(d)), and corresponds to the arbitrary choice of ba¬ 
sis for the underlying Hilbert space. This sort of sym¬ 
metry tends to be trivial, but becomes nontrivial (and 
interesting) in the context of quantum communication 
(the information to fix a reference frame was coined “un¬ 
speakable” by Asher Peres j46j, for the reason that two 
distant parties cannot communicate the choice of coordi¬ 
nate system over the phone, unless they already share a 
reference frame). 

Another simple undeterminable parameter is revealed 
by considering G of the form 


G = 


5I3 

0 


0 

1 ’ 


(19) 


where g £ R. This kind of scaling transformation 
trades overall state purity for the detector’s discrimi¬ 
nation power. That is, imperfections may be blamed, 
within some limits, more on the preparation of the states 
than on the quality of our detectors or vice versa [l8| . Let 
us refer to parameters like this as blame gauge degrees of 
freedom. 

We can characterize all 12 blame gauge parameters as 
follows. Every H (as appearing in (fl7l) l has a polar de¬ 
composition H = BR where R is real orthogonal and 
B is positive symmetric. This yields the following in¬ 
terpretation: We can always diagonalize B: the three 
eigenvalues are like the scale parameter g of (TT9l) . The 
basis in which B is diagonal (determined by three addi¬ 
tional parameters) determine which components of p and 
w can be rescaled in concert. 

The three parameters of R still correspond to the 
choice of Hilbert space basis (or of a coordinate system) 
as in (fl8l) [l9j . 

Finally, the last three parameters feature in transfor¬ 
mations of the form 


1 3 0 

a T 1 ’ 


( 20 ) 


where a is a real 3-vector. It is perhaps clearer what this 
transformation does when we write out the action on the 
system parameters explicitly: 

{ Pu —> Vu ~ « 

uii —>• Wi ( 21 ) 

Ui —> Ui + a ■ Wi. 

We see that this exchanges state-detector alignment for 
detector bias, and thus is another type of blame gauge. 

Having now completely classified the 12 gauge degrees 
of freedom in qubit SPAM tomography, let us briefly re¬ 
visit the more conventional types of (qubit) tomography. 
Having at our disposal M = 4 fiducial states assumed to 
be fully known allows for complete detector tomography 
precisely because we can write in that case W = P~ 1 S. 
Similarly, N = 3 known fiducial detectors allows us to 
perform complete quantum state tomography because we 
can write in that case P = SW~ X , where W~ x is to be 
interpreted as a right (pseudo-)inverse. 

From the perspective of SPAM tomography we see, 
therefore, that the assumptions underlying state and de¬ 
tector tomography simply boil down to fixing the blame 
gauge. In state tomography on qubits one fixes four pa¬ 
rameters each of three fiducial measurements, and in de¬ 
tector tomography one fixes four parameters each of three 
fiducial states. 


III. DETECTING CORRELATED ERRORS 

A. Uncorrelated States and Detectors 

A measurement yields averages of the form 

S^i = (Ei)^ = • Wi + m, 


( 22 ) 
















which contains two types of averages: one quantum ex¬ 
pectation value (in the state p M , denoted as usual by 
(.)), and one average over the different runs of the ex¬ 
periment (indicated by the over line). The existence of 
a model (P, W) that satisfies Eq. (fl3l) is precisely the 
statement that a system consists of effectively uncorre¬ 
lated states and detectors. The point is now that, cer¬ 
tainly for M > 4 and N > 3, not every possible set of 
data can be effectively uncorrelated simply because the 
number of data parameters is larger than the number of 
distinct uncorrelated model parameters whenever 

MN > 3M + 4iV - 12. (23) 

We will now answer the question under what conditions 
there is an uncorrelated model and when there is not. 


Q ± i does not exist either.) We will refer to the left hand 
side of (®D as a partial determinant of S and write 

A (S) = Qii Q 12 Q 22 Q 21 ■ (30) 

One should note that the ordering of the rows of P and 
columns of W was arbitrary and any such reordering 
would result in a different partial determinant. Note that 
a partial determinant can be defined for values of N and 
M as low as 5 [20]. 

The important conclusion is that the condition that 
AS = I 4 can be used to detect correlations: a violation of 
this equality (while taking into account error bars) shows 
that state preparation and measurement must have been 
correlated. 


B. Constraints for Uncorrelated Quantum Data: 
Partial Determinants 


Suppose we have quantum data S that is effectively 
uncorrelated, i.e., described by a model (P, W). For the 
sake of presentation, let us consider first the case N = 
M = 8 . Think of the data as partitioned into four 4x4 
corners of an 8 x 8 matrix, 


S = 


Q 11 Q 12 
Q 21 Q 22 


such that 

where 

— that is, 

Pi = 


Qab = PaWb 


P = 


Pi 

P 2 


; W=[W X W 2 ] 


Pi 

1 


P 4 

1 


1 T 


P 2 = 


Pb 

1 


P 8 
1 


1 T 


are the top and bottom halves of P and 


Wi = 


wi ■ ■ ■ w 4 
Ui ■■■ U 4 


Wo = 


w 5 •• • w 8 
u 5 ■■■ u 8 


(24) 


(25) 


(26) 


(27) 


(28) 


are the left and right halves of W. 

These submatrices of the data, because of Eq. (|25]>. 
have the property 

Q 11 Q 12 Q 22 Q 21 = I 4 , (29) 

under the assumption that the two inverse matrices on 
the left-hand side exist. (For example, when state 1 and 
state 2 are the same, or measurements 1 and 3 are the 
same, Q j^ 1 does not exist. Also, if one of the four vectors 
Pi for i = 1,2, 3,4 is a mixture of the other three, then 


C. Nonsingular Partial Determinants 


To simplify notation, instead of using Q a bS , let us write 
the partitions of the data, S, simply as 


A B 
C D 


(31) 


in which case we have 


A (5) = A~ 1 BD~ 1 C. (32) 


An awkward feature of this quantity is that it may dis¬ 
play singular behavior because of the presence of inverse 
matrices. A simple way to remedy this is to define a 
nonsingular partial determinant [ 211 ] 

V(S) = det(B)A(A - l) = BD T C - (det D)A (33) 


where D is the cofactor matrix of D so that, for example, 


A- 1 


det A 


A 1 


(34) 


We chose to multiply by A simply to reduce computa¬ 
tional complexity. This then gives rise to another (non¬ 
singular) condition on having uncorrelated SPAM errors. 
It has the disadvantage that satisfaction of the condition 
V(S) = 0 may occur not because one’s SPAM errors are 
uncorrelated, but because of having multiplied both sides 
of (EH 1 ) with zero. 


D. Estimates and Uncertainty for the Partial 
Determinant of Qubit Data 

We consider here briefly error bars in the partial deter¬ 
minant, which are needed to decide whether or not one’s 
data provides significant evidence for correlated SPAM 
errors. 

The data is estimated to maximize the likelihood of 
MN binomial distributions: 

(S)^ = (S^) = 2^A _ 1 

'Ifii 


( 35 ) 

















where n M j is the number of trials for the fi -th state and 
*-th detector settings and k^i is the number of positive 
outcomes. As such, the deviation of values from the ac¬ 
tual mean is 

(SS^) 2 = (S^ 2 ) - (S^) 2 = 1 ~ S ^, (36) 

We can then propagate these uncertainties to the non¬ 
singular partial determinant, 

V(S') = BD t C — (det D)A. (37) 

Assuming independent Gaussian random variables, the 
relative coefficients for the qubit case are 

= S ac S bd det D (38) 

oA cd 


dVab 

dB cd 


^ac^ijk^dlmCibBjlBkm 


(39) 


dV ab 

dC cd 


3 bd,£ cij ^ klm B a k D n Dj m 


(40) 


Qj-y — E-ijc^-kld ^ B a kDjiCib jiA a b^ (41) 

where 6 and e are the Kronecker and Levi-Civita symbols, 
reps., and repeated indices are to be summed over. If we 
choose N = M = 8, then the subdata can be chosen 
to be statistically independent and the error bar in the 
partial determinant is then simply 



Otherwise, if a smaller number of states and detectors is 
considered, one must take into account that the entries 
are not all statistically independent when calculating er¬ 
ror bars. 


E. Examples of Correlated Models 

A significant feature of these partial determinants is 
that they can be estimated directly from measured data. 
Stressing the point explicitly: “V(5) = 0” may serve as 
an operational definition for S to have no state-detector 
correlations. Certainly, if one finds that V(S') ^ 0 with 
high confidence, then one has made a definitive statement 
that there are state-detector correlations. 

It is perhaps helpful if we illustrate the concept of cor¬ 
related errors with a couple of simple models. For the 
sake of simplicity, let us restrict our attention to mea¬ 
surement devices whose biases iq may be chosen to all 
vanish. 


1. Gaussian model 

A first way to model correlation is to treat the Bloch 
vectors for all states and detectors as randomly fluctuat¬ 
ing: 


TV — TV° A fu 
m = w io + cji 


(43) 


where the and Wi a are constants and / M and g r are 
Gaussian noise with 


(U) = ( 9i) = fl- 

(44) 

Our data is then of the form: 


5 = P 0 W 0 + X 

(45) 

where 


= (f \j ■ gi) 

(46) 

is the scalar noise correlation. 

As a simple special case, consider X 
1. We then have 

= yls where x <C 

B n] + x 1 »=[ A+ c 1 ‘ 

d + x u] (47) 


where A 1 BD 1 C = A(P 0 Wo) = 1 4 so 

A -1 = 1 4 + x(C _1 I? _1 A + C~ 1 DB~ 1 ) 

and 

V = -x(det D) (A(PC)- 1 A - 1 4 ). (48) 

As expected, the entries of V are linear in the size of the 
correlations, x- 


2. Causal correlations 

A second model considers data of the form 

Spi = Pfi ■ w i{lJ (49) 

where the states, p iL , are constants but each detector has 
a value conditioned on the state they are detecting. Cor¬ 
relations of this type are causal in the sense that the 
state preparation event is considered to be prior to the 
detection event and therefore the former cannot depend 
on the latter, but the latter can depend on the former. 
More specifically, one could have, for example, 

Wi\» = w io + XPu (50) 

where Wio is fixed and x ■C 1 is a sort of linear suscepti¬ 
bility. In this case, the data is of the form 

S = PW 0 + Y (51) 















where 


P = 

Wo = 


Pm 

1 

Wio 

0 


Y^i = x\Pm\ 2 - 


(52) 


We can, obviously, construct additional more com¬ 
plex correlated models containing larger numbers of ad¬ 
justable parameters. Fitting actual data to the different 
models and applying model selection would allow us to 
find the best description of the correlations present in our 
experiment (as explained further in Ref. [171 ] in a similar 
context). 


IV. CONCLUSIONS 


detectors. These simplifications allowed us to fully ana¬ 
lyze errors in SPAM and especially correlations between 
such errors. We found a simple test for detecting such 
correlated errors in terms of a “partial determinant” (see 
Eq. (1291) or in terms of a nonsingular version, see Eq. 
both of which are determined directly by the observed 
frequencies of the outcomes of the M x N different mea¬ 
surements. Such conditions can be written down as soon 
as M > 5 and N > 5. 

Finally, by considering the full set of gauge parameters 
(i.e., parameters not determinable by any (set of) SPAM 
experiment (s)), we found a novel interpretation of stan¬ 
dard quantum state tomography and of the more recently 
developed detector topography: both can be seen as fix¬ 
ing the gauge parameters in their own particular way: the 
former by assuming some set of fiducial measurements is 
fully known, the latter by assuming some set of fiducial 
states is fully known. 


We considered a simple version of “gate-set tomogra¬ 
phy” p~3j] , in which only state preparations and measure¬ 
ments (SPAM) are performed, and no gates in between. 
As a special case we considered in detail qubits in M 
unknown states measured by N unknown two-outcome 
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